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Abstract 

We prove the quasi-optimal convergence of a standard adaptive finite element method (AFEM) 
for nonlinear elliptic second-order equations of monotone type. The adaptive algorithm is based on 
, residual-type a posteriori error estimators and Dorfler's strategy is assumed for marking. We first 

prove a contraction property for a suitable definition of total error, which is equivalent to the total 
| error as defined by Cascon et al. [2], and implies linear convergence of the algorithm. Secondly, we 

. use this contraction to derive the optimal cardinality of the AFEM. 

' Keywords: nonlinear elliptic equations; adaptive finite element methods; optimality. 
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1 Introduction 

The main goal of this article is the study of convergence and optimality properties of an adaptive finite el- 
ement method (AFEM) for quasi- linear elliptic partial differential equations over a polygonal/polyhedral 
domain Q, cR d (d = 2, 3) having the form 

J-V- [a(-,\Vu\ 2 )Vu] =/ inO 

u = on 90, 

where a : 51 x R + — > R + is a bounded positive function whose precise properties will be stated in 
Section 2 below, and / G L 2 (Q) is given. This kind of problems arise in many practical situations, 
for example, in shock-free airfoil design, seepage through coarse grained porous media, and in some 
glaciological problems [3]. 

AFEMs are an effective tool for making an efficient use of the computational resources, and for 
certain problems, it is even indispensable to their numerical resolvability. The ultimate goal of AFEMs 
is to equidistribute the error and the computational effort obtaining a sequence of meshes with optimal 
complexity Adaptive methods are based on a posteriori error estimators, that are computable quantities 
depending on the discrete solution and data, and indicate a distribution of the error. A quite popular, 
natural adaptive version of classical finite element methods consists of the loop 
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Solve -> Estimate — s- Mark — > Refine, (2) 

that is: solve for the finite element solution on the current grid, compute the a posteriori error estimator, 
mark with its help elements to be subdivided, and refine the current grid into a new, finer one. 

A general result of convergence for linear problems has been obtained by Morin, Siebcrt and Vccser [13], 
where very general conditions on the linear problems and the adaptive methods that guarantee conver- 
gence are stated. Following these ideas a (plain) convergence result for elliptic eigenvalue problems has 
been proved in [7]. On the other hand, optimality of adaptive methods using Dorfler's marking strat- 
egy^] for linear elliptic problems has been stated by Stevcnson[18] and Cascon, Kreuzer, Nochetto and 
Siebcrt [2]. Linear convergence of an AFEM for elliptic eigenvalue problems has been proved in [12], and 
optimality results can be found in [8, 4]. For a summary of convergence and optimality results of AFEM 
we refer the reader to the survey [15] and the references therein. We restrict ourselves to those references 
strictly related to our work. 
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Well-posedness and finite element error estimates for problem (1) have been stated in [3]. A posteriori 
error estimators for nonconforming approximations have been developed in [16]. Linear convergence of 
an AFEM for the yi-Laplacian problem in a context of Sobolev-Orlicz spaces has been established in [5]. 
Recently, the (plain) convergence of an adaptive inexact FEM for problem (1) has been proved in [9], 
where only a discrete linear system is solved before each adaptive refinement. 

In this article we consider a standard adaptive loop of the form (2) based on classical residual-type 
a posteriori error estimators, where the Galcrkin discretization for problem (1) is considered. We use 
the Dorflcr's strategy for marking and assume a minimal bisection refinement. The goal of this paper 
is to prove the optimal complexity of this AFEM by stating two main results. The first one establishes 
the convergence of the adaptive loop through a contraction property More precisely, we will prove the 
following 

Theorem 1.1 (Contraction property). Let u be the weak solution of problem (1) and let {Uk}keN be 
the sequence of discrete solutions computed through the adaptive algorithm described in Section 4- Then, 
there exist constants < p < 1 and /i > such that 

[T{U k+1 ) - F{u)] + m 2 k+1 < p 2 ([F(U k ) - T{u)} + ht£), Vfc € N , 

where [J-(Uk) ~ J~(u)] is a notion equivalent to the energy error and Tjk denotes the global a posteriori 
error estimator in the mesh corresponding to the step k of the iterative process. 

The second main result shows that, if the solution of the nonlinear problem (1) can be ideally approx- 
imated with adaptive meshes at a rate (DOFs)~ s , then the adaptive algorithm generates a sequence of 
meshes and discrete solutions which converge with this rate. Specifically, we will prove the following 

Theorem 1.2 (Quasi-optimal convergence rate). Assume that the solution u of problem (1) belongs to 
As. 1 Let {7fe}fceN an d {ffc}fceN denote the sequence of meshes and discrete solutions computed through 
the adaptive algorithm described in Section 4-, respectively. If the marking parameter 9 in Dorfler's 
criterion is small enough (cf. (29) and (38),), then 

[HV(t4 - u)\\l + osc 2 Tk (U k )] h = O ((#% - #T )- S ) , Vfc e N. 

The left-hand side is called total error and consists of the energy error plus an oscillation term. 

Basically, we follow the steps presented in [2] for linear elliptic problems. However, due to the 
nonlinearity of problem (1) the generalization of the mentioned results is not obvious. In particular, the 
Galerkin orthogonality property (Pythagoras) 

\\V(U - u)f Q + \\V(U V)f n = \\V(V u)\\l, (3) 

where U is a discrete solution and V is a discrete test function, is used for linear elliptic problems in 
order to prove the contraction property and a generalized Cea's Lemma (the quasi-optimality of the total 
error), and does not hold when we consider problem (1). To overcome this difficulty wc resort to ideas 
from [5], replacing (3) by the trivial equality 

\HU) Hu)] + \HV) HU)] = \HV) Hu% 

where each term in brackets is equivalent to the corresponding term in (3) (cf. Theorem 4.1 below), and 
thus establish some kind of quasi- orthogonality relationship for the energy error (cf. Lemma 5.1) which 
is sufficient to prove the quasi-optimality of the total error (cf. Lemma 5.3). 

Additionally, it is necessary to study the behavior of the error estimators and oscillation terms when 
refining. In order to do that, we need to show that certain quantity, which measures the difference 
of error estimators and oscillation terms between two discrete functions (cf. (21)), is bounded by the 
energy of the difference between these functions (see Lemma 3.7 in Section 3.3). This result can be 
proved with usual techniques for linear elliptic problems using inverse inequalities and trace theorems, 
but the generalization of this result to nonlinear problems requires some new technical results. Wc 
establish suitable hypotheses on the main coefficient a of problem (1) to be able to prove the mentioned 
estimation for the nonlinear problems that we study in this article. 

1 Roughly speaking, u € A s if u can be approximated with adaptive meshes with a rate (DOFs)~ s (cf. (42) in Section 6). 
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This paper is organized as follows. In Section 2 we present specifically the problem that we study 
and some of its properties. In Section 3, we present a posteriori error estimations. In Section 4 we state 
the adaptive loop that we use for the approximation of problem (1) and we prove its linear convergence 
through a contraction property. Finally, the last two sections of the article are devoted to prove that the 
AFEM converges with quasi-optimal rate. 

2 Setting 

Let H C M. d be a bounded polygonal (d = 2) or polyhedral (d = 3) domain with Lipschitz boundary. A 
weak formulation of (1) consists in finding u G H^fl) such that 

a(u;u,v) = L{v), Vd G JJg(fi), (4) 

where 

a(w;u,v)= / a( ■ , \Vw\ 2 )\7u ■ Vu, Vw, u, v G H^n), 
Jo. 

and 

L(v)= [ fv, VveH^ft). 
Jn 

In order to make this presentation clearer, we define B : f2 x R + — > R + by 



1 rt 



P(x,t) := - I a(x,r) dr, 



o 

and note that from Leibniz's rule the derivative of (3 as a function of its second variable satisfies 

dB 

D 2 B{x,t) :=^-{x,t) = ta{x,t 2 ). 
at 

We require that a is C 1 as a function of its second variable and there exist positive constants c a and C a 
such that 

c a <-^(x,t)=a(x,t 2 ) + 2t 2 D 2 a{x,t 2 )<C a , G fi, t > 0. (5) 

Since a(x, t 2 ) = D ^l 3 ( x -' t )- D ^P( x ^ ) = ^^(x, r), for some < r < f the last assumption yields 

c a < a(x,t) < C a , Viell, t>0. (6) 

It is easy to check that the form a is linear and symmetric in its second and third variable. Additionally, 
from (6) it follows that a is bounded, 

|o(«;;ti J t;)|<C r ||Vu||n||Vt;||n I Vw, w, u G Hq(Q), (7) 

and coercive, 

c a || Vu||^2 < a(w; u, u), Ww,ueHq(Q). 

Now, we sketch the proof that (5) is sufficient to guarantee the well-posedness of problem (4). Let 
7 : Q, x R d -> R + be given by 

1 /-l«l 2 

7(x,f) := fi(x, = -J a(x,r) dr, 

and note that if V27 denotes the gradient of 7 as a function of its second variable, then 

V 2l (x,Z)=a(x,\t\ 2 )t, V.TGf>, £GK d . (8) 

Condition (5) means that D 2 B is Lipschitz and strongly monotone as a function of its second variable 
and it can be seen that V27 so is [21]. 
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If A : Hq(SI) — > (O) is the operator given by 

(Au, v) := a(u; u, v), Vu, v £ Hq (fi), 
problem (4) is equivalent to the equation 

Au = L. 

where L £ .H" _1 (f2) is given. It is easy to check that the properties of V27 are inherited by A, i.e., A is 
Lipschitz and strongly monotone. More precisely, there exist positive constants Ca and ca such that 

\\Au-Av\\ H - l(u) <C A \\V(u-v)\\n, Vu, v £ H^Sl), (9) 

and 

(Am — Av, u — v) > ca||V (u-v)\\l, Vm,«£H 1 (O). (10) 

As a consequence of (9) and (10), problem (4) has a unique stable solution [20, 21], which will be denoted 
throughout this article by u. 

3 Discrete solutions and a posteriori error analysis 
3.1 Discretization 

In order to define discrete approximations to problem (4) we will consider triangulations of the domain 
fi. Let 7o be an initial conforming triangulation of fi, that is, a partition of SI into d-simplices such that 
if two elements intersect, they do so at a full vertex/edge/face of both elements. Let us also assume that 
the initial mesh 7o is labeled satisfying condition (b) of Section 4 in Ref. [19]. Let T denote the set of all 
conforming triangulations of VL obtained from 7o by refinement using the bisection procedure described 
by Stevenson [19], which coincides, (after some re-labeling) with the newest vertex bisection procedure 
in two dimensions and the Kossaczky's procedure in three dimensions [17]. 
Due to the processes of refinement used, the family T is shape regular, i.e., 

diam(T) 

sup sup =: kj < 00, 

TeT Ter Pt 

where diam(T) is the diameter of T, and px is the radius of the largest ball contained in it. Throughout 
this article, we only consider meshes T that belong to the family T, so the shape regularity of all of 
them is bounded by the uniform constant kt which only depends on the initial triangulation 7o [17]. 
Also, the diameter of any element T £ T is equivalent to the local mesh-size Ht '■= IT] 1 ^, which in turn 
defines the global mesh-size Hj- := max Ht- Also, the complexity of the refinement can be controlled, 

as described in Lemma 6.3 below. 

Hereafter, we denote the subset of T consisting of neighbors of T by A/V(T) and the union of T and 
its neighbors in T by Uj-(T). More precisely, 

Af T (T) := {T' e T \ T'n7V0}, w r (T) := (J T' . 

For the discretization we consider the Lagrange finite element spaces consisting of continuous functions 
vanishing on dfl which are piecewise linear over a mesh T £ T, i.e., 

V r := {V £ ffo(fi) V\ T eVi(T), V T £ T}. (11) 

The discrete problem associated to (4) consists in finding U £ V7- such that 

a{U;U,V) = L(V), VV £ V r . (12) 

Note that the discrete problem (12) has a unique solution because A\ y is Lipschitz and strongly mono- 
tone (cf. (9)-(10)). 

At this point, it is important to remark that the discrete problem (12) is also nonlinear, and for our 
analysis we will assume that it can be solved exactly in every mesh T £ T. However, this assumption is 
usual even though in practice, even for discrete linear problems, we compute only approximations to the 
solution of discrete problems. The optimality of inexact methods has been studied for linear problems 
in [18, 14], and a generalization to nonlinear problems is subject of future work. 
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3.2 A posteriori error estimators 

In this section we present the a posteriori error estimators for the discrete approximation (12) of prob- 
lem (4) and state results showing their reliability and efficiency. These estimations will be useful in order 
to prove the optimality of the AFEM in Section 6. 
The residual of V G V7- is given by 



(R(V),v) := a(V;V,v) - L(v) 
Integrating by parts on each T G T we have that 



(R(n«) = E (7 rt{v)v+ I , 



where Rq-{V) denotes the element residual given by 

R T {V) W :=-V-[a(-,|W| 2 )W]-/, 

and J-j-(V) the jump residual given by 

1 



VT g r, 



J T (V) 



Is 



(q( • , \VV\ z )VV) Wi ■ n, + («( • , | W^W),^ • n 2 



(13) 



(14) 



for each interior side S, and Jj-(y)i := 0, if S is a side lying on the boundary of fi. Here, T\ and T2 
denote the elements of T sharing S, and n\ and n\ are the outward unit normals of T± and T2 on S, 
respectively. 

We define the local a posteriori error estimator rj-f(V; T) of V G V7- by 



r? T {V;T) := ff| pr^llr + #t || J r (V)|| 2 



VT e r, 



(15) 



and the global error estimator rjj-(V) by 



TET 

In general, if 3 C T we denote (Etgh T )) * b y Vt(V; S). 

The next lemma establishes a local lower bound for the error. Its proof follows the usual techniques 
taking into account that if u denotes the solution of problem (4), 

|<R(VV>| = \a(V;V,v) - L(v)\ = \a(V; V,v) - a(u;u,v)\ < C A \\W{V - u)\\ u \\Vv\\ u , 
for V G V7-, whenever v G Hq(£1) vanishes outside of ui, for any u C 0. 

Lemma 3.1 (Local lower bound). Let u £ Hq(SI) be the solution of problem (4). Let T G T and T G T 
6e/ixerf. //V G V r , 2 



^r(^; T) < || V(V - u) |L T(T) + i? T flr(V) - Rr(V) 



m 



wt(T) 



J r (V) - J r (^) 



(16) 



where Rj-(V)< / denotes the mean value of Rq-(V) on T' , for all T' G Afj-(T), and for each side S C dT, 
Jt{V)\ s denotes the mean value of Jf(V) on S. 

The last result is known as local efficiency of the error estimator. According to the lemma, if a local 
estimator is large, then so is the corresponding local error, provided the last two terms in the right-hand 
side of (16) are relatively small. 

We define the local oscillation corresponding to V G V7- by 



oscf(V; T) := R r (V) - R r {V) 



H 7 



J r (V) - J T (V) 



dT 



VTg T, 



2 From now on, we will write a < b to indicate that a < Cb with C > a constant depending on the data of the problem 
and possibly on shape regularity Kj of the meshes. Also a ~ b will indicate that a < b and b < a. 
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and the global oscillation by 

osc 2 T (V) := ^osc^T). 
TeT 

In gener al, if 3 C r we denote (Etgh osc^(V; T)) 2 by osc r (V"; S). 

As an immediate consequence of the last lemma, adding over all elements in the mesh we obtain the 
following 

Theorem 3.2 (Global lower bound). Let u 6 Hq(Q.) denote the solution of problem (4). Then, there 
exists a constant Cl = Cj,{d, kt,Ca) > such that 

C L if T {V) <\\V(V -u)\\l + osc 2 T {V), V7eV r , VTgT. 

We conclude this section with two estimations for the error, whose proofs arc strongly based on the 
analogous results for linear elliptic problems (cf. [2]). 

Theorem 3.3 (Global upper bound). Let u G ifg(f2) be the solution of problem (4). Let TeT and let 

U G V7- be the solution of the discrete problem (12). Then, there exists Cjj = Cjj(d, K,Y,C a ,c ai CA) > 
such that 

\\V(U-u)\\l<C uV 2 r (U). (17) 

Proof. Let u G H^(Vl) be the solution of problem (4). Let T G T and let U G V7- be the solution of the 
discrete problem (12). Let w £ Hq(H) be the solution of the linear elliptic problem 

a(U;w,v) = L(v), Vu e H%(Q). (18) 

Since A is strongly monotone (cf. (10)), using that u is the solution of problem (4), (18), and that a is 
bounded (cf. (7)), we have that 

c A \\V{U -u)\\ 2 n < (AU ~Au,U -u) =a(U;U,U-u)-a(u;u,U -u) 
= a(U; U,U -u)- a(U; w,U - u) = a(U; U -w,U - u) 
<C a \\V(U-w)\\ n \\V(U-u)\\ Q , 

and thus, 

||V(tf-«)|| n <^||V(tf-«0|| n . 

CA 

Since U is solution of the Galerkin discretization of the linear elliptic problem (18) in V7- (see (12)), 
using the reliability of the global error estimator for linear problems (cf. [2, Lemma 2.2]), it follows that 
there exists Cy — Cjj{d, kt, C a , c a , ca) > such that (17) holds. □ 

Theorem 3.4 (Localized upper bound). Let TeT and let 71 £ T be a refinement of T- Let TZ denote 
the subset ofT consisting of the elements which are refined to obtain 71, that is, 1Z := {T G 7~ \ T j£ 71}. 
Let U G V7- and U* G V7; be the solutions of the discrete problem (12) in V7- cmd V7;, respectively. 
Then, there exists a constant Clu = Ci,u{d, Hf,C a ,c a ,CA) > such that 

\\V(U-U*)\\ 2 ,<C LUV 2 T (U;K). (19) 

Proof. Let T, % , 1Z, U and [/» be as in the assumptions of the theorem. Let G V7; be the solution 
of the discrete linear elliptic problem 

a(U;W*,V*) =L(V*), VV»eV r ,. (20) 

Analogously to the last proof, using that A is strongly monotone, that U* is the solution of problem (12) 
in V7; , (20) and that a is bounded, we have that 

c A \\V(U -U*)\\l < (AU - AU„U -U*) = a{U;U,U - U,) - a(U*;U*,U - {/») 
= a(U; U,U- U*) - a(U; W*, U-U*) = a(U; U -W„U - U,) 
<C ||V(J7-W*)|| n ||V(?7-C/*)||n, 



G 



and therefore, 

||V([/-[/*)|| n < — \\V(U-W*)\\n. 

CA 

Finally, since U and W* are the solutions of the Galerkin discretization of the linear elliptic problem (18) 
in Vy and V7; , respectively (cf. (12) and (20)), using the localized upper bound for linear problems (cf. [2, 
Lemma 3.6]), it follows that there exists Clu = Cm{d, Kt, C a ,c a , ca) > such that (19) holds. □ 

3.3 Estimator reduction and perturbation of oscillation 

In order to prove the contraction property it is necessary to study the effects that refinement has upon 
the error estimators and oscillation terms. We thus present two main results in this section. The first 
one is related to the error estimator and it will be used in Theorem 4.2. 

Proposition 3.5 (Estimator reduction). Let T G T and let A4 be any subset of T ■ Let % G T be 

obtained from T by bisecting at least n > 1 times each element in M. IfVE V7- and V* G V7;, then 

Vt,( V *) < (1 + S ) Wt(V) - (1 - 2-5)t£(V;M)} + (1 + S-^CeIMV. - V)\\l, 

for all S > 0, where Ce > I is a constant (cf. Lemma 3.7 below). 

The second result is related to the oscillation terms. It will be used to establish the quasi- optimality 
for the error (see Lemma 5.3) and to prove Lemma 5.4 in the next section. 

Proposition 3.6 (Oscillation perturbation). Let TeT and let % G T be a refinement ofT. IfV& V7- 
and V* G V7; , then 

osc 2 r (V;Tn%) < 2osc^(K;Tn7;) + 2C E \\V(V* - V)\\l, 
where Ce > 1 is a constant (cf. Lemma 3.7 below). 

In order to prove Propositions 3.5 and 3.6 we observe that if we define for T E T and V, W € V7- 

g r (V,W;T) :=H t \\R t (V)-Rt(W)\\ t + H^ \\J T (V) - J T (W)\\ dT , (21) 
then from the definition of the local error estimators (15) and the triangle inequality it follows that 

r lT (W;T)<r /r (V;T)+g T (V,W;T), VT G T, (22) 

and analogously 

osc r {W;T) <osc T (V;T)+g r (V,W;T), VT G T. (23) 

After proving that gf(V, W; T) is bounded by ||V(V — W / )|| L j r (T) 5 the first terms on the right-hand sides 
of (22) and (23) may be treated as in [2, Corollary 3.4 and Corollary 3.5] for linear elliptic problems, 
respectively, and the assertions of Propositions 3.5 and 3.6 follow. On the other hand, while proving 
that gr(y, W; T) < ||V(V — W0|| wr (T) is eas Y f° r linear problems by using inverse inequalities and trace 
theorems, it is not so obvious for nonlinear problems. Therefore, we omit the details of the proofs of 
the last two propositions, but we prove the following lemma, which is the main difference with linear 
problems [2]. 

Lemma 3.7. Let T G T and let gj- be given by (21). Then, there holds that 

9t(V,W;T)<\\V{V-W)\\ Ut( t), VV,WeV T , VT G T. (24) 

Consequently, there exists a constant Ce > 1 which depends on d, kt and the problem data, such that 

Y t 9r(V,W;T)<C E \\V(V-W)\\l, W,^GV r . (25) 
TeT 

In order to prove Lemma 3.7, we define 

T v {x) := V 2 7(i,W(i)) =a(x, | W(cc)| 2 )W(a;), Vi G fi, (26) 
and prove first the following auxiliary result. 



7 



Lemma 3.8. Let T G 7~. Let D 2 j be the Hessian matrix of 7 as a junction of its second variable. If 

- D 2 2l (y,Oh < C^\x - y\, Vx,y G T, £ G K d , 

/or some constant C 1 > 0, then for all V, W G V\{T), there holds that 

\T v {x) - T w (x) - T v (y) + T w (y)\ < C 7 ||V(F - W0|U. (r) |x - y\, V x,y G T. 

Remark 3.9. Taking into account (8), we have that 

(D§7fo0)« = 2^20(0;, |£| a )&& + |£| 2 )%, 

for 1 < i,j '•< d, where Sij denotes the Kronccker's delta. In consequence, if a(-,t) and -D 2 a(-,i)i are 
Lipschitz on each T G 7o uniformly in t > 0, it follows that £^7(2;, £) is locally Lipschitz as a function 
of its first variable, i.e., there exists a constant C 1 > such that 

\\Dh(x,0-D 2 2l (y,0h < C^\x-y\, Vx,y£T, £ G M. d , 

for all T G To- In particular this holds for any T G T, T G T. 

Proo/ of Lemma 3.8. Let T G 7". Let V, W G 7>i(T) and x,y eT. Taking into account that V and W 
are linear over T, we denote v := W(x) = VV(y) and w := VW(a;) = VM / (y). Thus, we have that 

\T v (x) - T w (x) - Tv(y) + Fw(y)\ = |V 2 7(x, v) - V 2 j(x, w) - V 2 j(y,v) + V 2 7(y, w) 

= / [D 2 j(x, w + r(v — w)) — Z?27(y, w + r(v — w))] (v — w) dr 
Jo 

< C 7 \x - y||v - w|, 
which completes the proof of the lemma. 

We conclude this section with the proof of Lemma 3.7, where we use that 

Jt(Y)\s = \ ( T v Wl ■ fii + r V[T2 ■ n 2 ) , sen, 



□ 



Rt{v) w = -v-r v -f, 



and 



which is an immediate consequence of (26) and the definitions of the element residual (13) and the jump 
residual (14). 

Proof of Lemma 3. 7. Let T G T and let V, W G V r . Let T G T be fixed. 
["■"I By Lemma 3.8, for the element residual we have that 



\\R T (V) - R T (W)\\ T = ||V • OV - IV)|| T < Hf ||V ■ (T v -T 



W)\\L°°(T) 



< H£ sup 

x,y£T 
x=£y 



\r v (x)-r w (x)-r v (y) + r w (y)\ 
\x - y\ 



and thus. 



<Hj\\V(V-W)\\ L oo (T) = \\V(V-W)\\ T , 



H T \\R T {V) - R r (W)\\ T < \\V(V - W)\\ T . 



(27) 



PH Consider now the term corresponding to the jump residual. If S is a side of T which is interior to O 
and if T\ and T 2 are the elements sharing S, we have that 



||J r (^)-J r (W)||, 



i=l,2 



< 53 |(r v -r w ) h 



i=l,2 



< 



53 (ir T ' ||r v - rv|| Tj + #| ||v(r v - iwOlh) , 



i=l,2 
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where we have used a scaled trace theorem. Since V27 is Lipschitz as a function of its second variable, 
we have that 

\T v (x) - T w (x)\ = |V 27 (z, VV(x)) - V 27 (a;, VW(x))\ < \W(x) - VW(x)\, 
for x G Ti = 1,2), and therefore, 

\\V v -Y w \\ Ti < \\v(y-w)\\ Ti , * = 1,2. 

Using the same argument as in Q~], we have that ||V(ry — Fyr)!!^ < || V(V — W)||r<! for i = 1,2, and in 
consequence, 

ff| || J r (U) - JrCWOIIsr < l|V(y - WOLt(t)- (28) 
Finally, (24) follows from (27) and (28), taking into account (21). □ 

4 Linear convergence of an adaptive FEM 

In this section we present the adaptive FEM and establish one of the main results of this article (Theo- 
rem 4.2 below) which guarantees the convergence of the adaptive sequence. 

4.1 The adaptive loop 

We consider the following adaptive loop to approximate the solution u of problem (4). 

Adaptive Algorithm. Let 7o be an initial conforming mesh of O and let 9 be a 
parameter satisfying < 8 < 1. Let k = 0. 

1. U k :=SOLVE(T fc ). 

2. {vk(T)} T eT k ■= ESTIMATED, %). 

3. M k := MARK({ Vk (T)} T£Tk ,T kl 8). 

4. %+i := REFINE(T fe , M k ,n). 

5. Increment k and go back to step 1. 

Now we explain each module in the last algorithm. 

• The module SOLVE. This module takes a conforming triangulation T k of il as input argument 
and outputs the solution U k of the discrete problem (12) in T k ; i.e., U k G Y k := Vf k satisfies 

a(U k ;U k ,V) =L(V), V V G V*. 

• The module ESTIMATE. This module computes the a posteriori local error estimators rj k (T) of 
U k over % given by %(T) := r, Tk {U k -T), for all T G %, (see (15)). 

• The module MARK. Based on the local error estimators, the module MARK selects a subset M. k 
of 7fe, using an efficient Dorfler's strategy. More precisely, given the marking parameter 8 G (0, 1), 
the module MARK selects a minimal subset Ai k of T k such that 

Vk(M k ) >8r, k (T k ), (29) 

where Vk {M k ) = {EreM k ^k( T )Y ^ = (Eren ■ 

• The module REFINE. Finally, the module REFINE takes the mesh 71- and the subset M. k C T k as 
inputs. By using the bisection rule described by Stevenson in [19], this module refines (bisects) n 
times (where n > 1 is fixed) each element in Ai k . After that, with the goal of keeping conformity of 
the mesh, possibly some further bisections are performed leading to a new conforming triangulation 
7fc+i G T of which is a refinement of 7~ k and the output of this module. 

From now on, U k , {rj k (T)}TeT k i -Mk, T k will denote the outputs of the corresponding modules SOLVE, 
ESTIMATE, MARK and REFINE, when iterated after starting with a given initial mesh %■ 
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4.2 An equivalent notion for the error 

In order to prove a contraction property for the error of a similar AFEM for linear elliptic problems the 
well-known Galerkin orthogonality relationship is used(scc [2]). In this case, due to the nonlinearity of 
our problem, this property does not hold. We present an equivalent notion of error so that it is possible 
to establish a property analogous to the orthogonality (cf. (37) below). 
It is easy to check that J : — > K given by 

J(v):= f (A(rv),v)dr= f j(-,Vv) dx, \/v G H%(Sl), 
Jo Jn 

is a potential for the operator A. More precisely, if W is a closed subspace of H^ft), the following claims 
are equivalent 

• w G W is solution of 

a{w; to, v) = L(v), Vw G W, (30) 
where L(v) = J n fv, for v G Hq(Q). 

• w G W minimizes the functional J- : Hq(&1) — > R over W, where T is given by 

F(v):=J(v)-L(v)=["f{;Vv)-fvdx, v G Hq(Q). (31) 
Jn 

The following theorem states a notion equivalent to the (Sl)-crror. The proof follows the ideas 
used in [5] and uses that the Hessian matrix of 7, denoted by -0^7, is uniformly elliptic, i.e., 

ca\C\ 2 <D 2 2l (x,0C-C<c A \C\ 2 , V*GfU,CGR d . (32) 

This fact holds because V27 is Lipschitz and strongly monotone as a function of its second variable. 

Theorem 4.1. Let W be a closed subspace of H^n) and let J- be given by (31). If w G W satisfies (30), 
then 

y||V(« - w)\\l < F{v) - F{w) < ^f\\V(v - w)f a , Vv G W. 

Proof. Let W be a closed subspace of Hq (f2) and let w G W be the solution of (30). Let vSWbe fixed 
and arbitrary. For z G R, we define <fi(z) := (1 — z)w + zv, and note that 

4> (z) = v — w and V<j){z) = (1 — z)Vm + zVv. 

If we define V'( z ) :== ^ r (0( 2: ))j integration by parts yields 

F{v) - F{w) = if)(l) - -0(0) = V'(0) + / V"(^)(l-2)^- (33) 

Jo 

From (31) it follows that 

i>{z) = F{4>{z)) = / 7 (ar, V0(z)) da: - / /0(z) da:, (34) 



and therefore, in order to obtain the derivatives of ip we first compute ^(7(0;, V0(z))), for each x G O 
fixed. On the one hand, we have that 

— 7 (., V^(z)) = V 27 (-, V0(z)) • ^V0(^) = V 27 (-, V<^)) • V(« - w), 

and then 

9 2 



^ 7 (-, V4>(z)) = D| 7 (-, V^(z))V(« - «;) • V(t> - w ) 
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where D?,^ is the Hessian matrix of 7 as a function of its second variable. Thus, taking into account that 
4>"{z) = for all z G E, from (34) it follows that 



i/)"(z)= / D1j{x,V4>{z))V{v -w) ■ V(v-w) dx. (35) 
Jn 

Since w minimizes T over W, we have that tp'(0) = 0; and using (35), from (33) we obtain that 



F{v) - F{w) = ( ( D 
Jo Jn 

Finally, since D?,j is uniformly elliptic (cf. (32)) we have that 



2 7(x, V0(z))V(w — w) ■ V(i> — — z) dx dz. 

Jn 



y ||V(v - w)\\l <J n J n Dh{x, V0(*))V(w - «;) • V(« - t»)(l - «) da; dz < ^\\V(v - w)f Q , 

which concludes the proof. □ 
As an immediate consequence of the last theorem, 

y ||V([/ fc - U p )\\l < T(U k ) - F{U P ) < ^f\W(U k - U P )\\ 2 Q , Vk,p G N 0) k < p, (36) 
and the same estimation holds replacing U p by u, the exact weak solution of problem (4). 

4.3 Convergence of the adaptive FEM 

Recall that u denotes the exact weak solution of problem (4), and Uk, {rjk(T)}T£T k j A4fc, Tk will denote 
the outputs of the corresponding modules SOLVE, ESTIMATE, MARK and REFINE of the Adaptive 
Algorithm when iterated after starting with a given initial mesh 7o- 

Taking into account the estimator reduction (Proposition 3.5), the global upper bound (Theorem 3.3) 
and (36) , we now prove the following result which establish the convergence of the Adaptive Algorithm. 

Theorem 4.2 (Contraction property). There exist constants < p < 1 and fi > which depend on 
d, kj, of problem data, of number of refinements n performed on each marked element and the marking 
parameter 6 such that 

muk+i) - ^(«)] + ml+i < p 2 mu k ) - r(u)] + w*), vfc g No, 
1 

where rjk '■= (X^TeTfc n k(T)) 2 denotes the global error estimator in Tk- 
Proof. Let k G No, using that 

F(Uk) - ?(u) - T(U k ) - HUk+i) + HUk+i) - Hv), (37) 
and the estimator reduction given by Proposition 3.5 with T = Tk and Tl = Tk+i we have that 

[T{u k+ i) - ^(«)] + wl+i < lHu k ) - Hu)} - \HUk) - HUk+x)] 

+ (1 + 5)n {4 - ti£(M k )} + (1 + S-^CEfAWiUk - u k+l )\\l, 

for all <5,/i > 0, where £ := 1 - and rj 2 k {Mk) ■= J2reM k 7 ?fc( T )- % choosing fj, := 2 (i + g-i)c E > and 
using (36) it follows that 

[F{U k+ i) - T{u)] + < [F{U k ) - T{u)} + (1 + 8)p {rf k - £4(M k )} . 

Dorncr's strategy yields rjk(Aik) > Qf]k and thus 

[T{U k+ i) - T(u)] + < [HUk) - ?{u)] + (1 + S) f irf k - (1 + S)tfPr& 

= [HUk) Hu)] + (i + 8)? (1 4 (1 + 5)^4. 
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Using (36), the global upper bound (Theorem 3.3) and that (1 + S)p = |^ it follows that 
\HUk + i) F(u)] + < [HUk) F(u)] + (1 + 6)vl (l -^Pjvt- 2 c A u CEC A [HUk) ~ Hl 



If we define 
we thus have that 

[F(U k+1 ) - F(u)] + M^fc+i < Pi(5)[^(C/ fc ) - .F(u)] + W>I(<5)r?fc- 

The proof concludes choosing 5 > small enough to satisfy 

< p := max{p 1 (S),p 2 (S)} < 1. □ 

The last result, coupled with (36) allows us to conclude that the sequence {Uk}ken a °f discrete 
solutions obtained through the Adaptive Algorithm converges to the weak solution u of the nonlinear 
problem (4), and moreover, there exists p £ (0, 1) such that 

\\V(U k -u)\\ n <Cp k , VfceNo, 

for some constant C > 0. Also, the global error estimators {?7fc}feeN tend to zero, and in particular, 

Vk<C P k , VfcGNo, 

for some constant C > 0. 



5 Optimality of the total error and optimal marking 

In this section we introduce the notion of total error, we show an analogous of Cea's lemma for this 
new notion (see Lemma 5.3) and a result about optimal marking (see Lemma 5.4). Both of them will 
be very important to establish a control of marked elements in each step of the adaptive procedure (cf. 
Lemma 6.2 in Section 6). 

We first present an auxiliary result that will allow us to show the analogous of Cea's lemma for the 
total error. Its proof is an immediate consequence of Theorem 4.1 and will thus be omitted. 

Lemma 5.1 (Quasi-orthogonality property in a mesh). If U 6 V7- denotes the solution of the discrete 
problem (12) for some TeT, then 

\MU u)\\ 2 Q + \MU V)\\ 2 Q < ^\MV - u)\\l WeY T , 

CA 

where Ca and ca are the constants appearing in (9) and (10). 

Since the global oscillation term is smaller than the global error estimator, that is, oscr^) < ifr(U), 
using the global upper bound (Theorem 3.3), we have that 

\\V(U-u)\\l + osc 2 T (U) < (Cu + l) v 2 r (U), 

whenever u is the solution of problem (4) and U G V7- is the solution of the discrete problem (12). 
Taking into account the global lower bound (Theorem 3.2) we obtain that 

rtr(U)* {\\V(U-u)\\l + osc 2 T (U)y . 

The quantity on the right-hand side is called total error, and since adaptive methods are based on the a 
posteriori error estimators, the convergence rate is characterized through properties of the total error. 
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Remark 5.2. (Cea's Lemma) Taking into account that A is Lipschitz and strongly monotone, it is easy 
to check that 

!|V(C/- U )|h<— inf ||V(V-tO|| n . 

CA VGY T 

This estimation is known as Cea's Lemma and shows that the approximation U is optimal (up to a 
constant) of the solution u from V7-. 

A generalization of Cea's Lemma for the total error is given in the following 

Lemma 5.3 (Cea's Lemma for the total error). If U £ Vf denotes the solution of the discrete prob- 
lem (12) for some T G T, then 

|| V(U - u)f n + osc 2 T (U) < inf (||V(V - u)\\ 2 a + osc 2 r (V)), 

CA V£Vt 

where Ce > 1 is the constant given in (25). 

Proof. Let T £ T and let U £ V7- be the solution of the discrete problem (12). If V £ ¥7-, using 
Proposition 3.6 with 71 = T and Lemma 5.1 we have that 

||V(C/ - u)\\l + osc 2 T (U) < \\V(U - u)\\l + 2os$-(V) + 2C E \\V(V - U)\\ 2 n 

< 2C E — \\V(V ~u)\\l + 2osc 2 T (V) 

CA 

£ ^ECA {lMv _ unl + Qsc?riv) y 
CA 

Since V £ V7- is arbitrary, the claim of this lemma follows. □ 

The following result establishes a link between nonlinear approximation theory and AFEM through 
Dorfler's marking strategy. Roughly speaking, it is a reciprocal to the contraction property (Theo- 
rem 4.2). More precisely, we prove that if there exists a suitable total error reduction from T to a 
refinement T*, then the error indicators of the refined elements from T must satisfy a Dorfler's prop- 
erty. In other words, Dorfler's marking and total error reduction are intimately connected. This result is 
known as optimal marking and was first proved for linear elliptic problems by Stevenson [18]. The notion 
of total error presented above was first introduced by Cascon et al. [2] for linear problems, together with 
the appropriate optimal marking result, which we mimic here. 

In order to prove the optimal marking result we assume that the marking parameter 9 satisfies 



< 9 < O := 



1 + 2C LU (1 + C E ) 



(38) 



where Cl, Clu are the constants appearing in the global lower bound (Theorem 3.2) and in the localized 
upper bound (Theorem 3.4), respectively, and Ce is the constant appearing in (25). 

Lemma 5.4 (Optimal marking). Let T £ T and let T* £ T be a refinement of T . Let 1Z denote the 
subset ofT consisting of the elements which were refined to obtain 71, i.e., 1Z = 7~\7l. Assume that the 
marking parameter 9 satisfies < 9 < 9q and define v := ^(l — |^) > 0. Let U and {/* be the solutions 
of the discrete problem (12) in V7- andW%, respectively. If 

\WU. - u)\\l + osc^t/*) < v (||V(C/ - u)\\l + osc 2 T (U)) , (39) 

then 

r,r(U;n)>er, r (U). 

Proof. Let T, %, TZ, U, E/*, 9 and v be as in the assumptions. Using (39) and the global lower bound 
(Theorem 3.2) we obtain that 

(l-2u)C L rfr(U) < (l-2^)(||V([/-u)|| 2 , + os C 2-(C/)) 

< l|V(t/ - u)\\l - 2||V(tf. - u)\\l + osc 2 T {U) - 2os4; ([/*). (40) 
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Since \\V(U - u)\\ n < \\V(U, - u)\\ a + ||V(J7* - U)\\ n , we have that 

||V([/ - u)f Q - 2||V(tf. - u)f Q < 2||V(tf. - COIIn- (41) 

Using Proposition 3.6 and that osc^-(C7 ; T) < r]j-(U;T), if T G 7£ = T \ 7*; for the oscillation terms we 
obtain that 

osc 2 T (U) - 2os4;([/») < 2C E ||V(t/* - U)\\l + r^JJ\H). 
Taking into account (41) and the last inequality, from (40) it follows that 

(1 - 2v)C L n 2 r {U) < 2||V(£7 - U*)\\l + 2C E \\V(U - E/*)||n + rfr(U;K), 

and using the localized upper bound (Theorem 3.4) we have that 

(1 - 2v)C m 2 r (U) < 2(1 + C E )C LU n 2 T {U; K) + if T {U; K) = (1 + 2C LU (1 + C E ))r, 2 T {U; 11). 

Finally, 

-Vt(U)<Vt(U;U), 



1 + 2C LU (1 + C E ) 

which completes the proof since i + 2c^^(i+c g ) = 0- — ^ v )®v = ^ ^ ^ ne definition of v. □ 

6 Quasi-optimality of the adaptive FEM 

In this section we state the second main result of this article, that is, the adaptive sequence computed 
through the Adaptive Algorithm converges with optimal rate to the weak solution of the nonlinear 
problem (4). For N E No, let Tat be the set of all possible conforming triangulations generated by 
refinement from 7o with at most N elements more than 7o, i.e., 

Tjv:={TeT| #T-#T <iV}. 

The quality of the best approximation in Tjv is given by 

°n(u):= inf inf [\\V(V - u)\\ 2 n + osc 2 T (V)] k . 



For s > 0, we say that u 6 A s if 



sup {(N + l)Vjv(u)} < oo. (42) 
JVeNo 



In other words, u belongs to the class A s if can be ideally approximated with adaptive meshes at a rate 
(DOFs)~ s . From another perspective, if u € A s , then for each e > there exist a mesh T £ G T and a 
function V e € V7; such that 

#7;-#75<H;e-- and \\V(V £ - u)\\ 2 Q + osc 2 % (V s ) < e 2 . 

The study of classes of functions that will yield such rates is beyond the scope of this article. Some 
results along this direction can be found in [1, 10, 11]. 

The following result proved in [18, 2], provides a bound for the complexity of the overlay of two 
triangulations T 1 and T 2 obtained as refinements of 7o- 

Lemma 6.1 (Overlay of triangulations). For T 1 , T 2 G T the overlay T :— T 1 © T 2 G T, defined as the 
smallest admissible triangulation which is a refinement of F 1 and F 2 , satisfies 

#T<#T 1 + #T 2 -#To. 

The next lemma is essential for proving the main result below (see Theorem 6.4). 
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Lemma 6.2 (Cardinality of M.}.). Let us assume that the weak solution u of problem (4) belongs to A s . 
// the marking parameter 9 satisfies < 9 < 9q (cf. (38) ), then 

i 

#M k < ( V S \u\S [\MU k -u)\\ 2 n + osc 2 Tk (£4)]~ A , VfceN , 

Proof. Let fe £ No be fixed. Let £ = e(fc) > be a tolerance to be fixed later. Since it £ A s , there exist 
a mesh 7^ £ T and a function £ V7; such that 

#7: - #T < |u|.'e-i and ||V(V e - u)f Q + osc 2 % (V £ ) < e 2 . 

Let % := T e © 7fe the overlay of 7j and 71- (cf. Lemma 6.1). Since 14 £ V7;, we have that 
0SC7; (V e ) > 0SC7; (V £ ), and from Lemma 5.3, if f7* £ V7; denotes the solution of the discrete problem (12) 
in Vr» , we obtain that 

l|V(£/* - u)||S + OBcf.(U*) < 2C E ^ (|| V(y e +osc^(T4)) < 2C E ^e 2 . 

CA C A 

Let e be such that 

||V([/* - u)\\l + os4.(CT.) < 1; (||V(t4 - u)\\l + osc 2 n (U k )) = 2C E ^e 2 , 

CA 

where v is the constant given by Lemma 5.4. Thus, this lemma yields 

Vr k (U k ;K k ) >0 Vn (U k ), 

if lZ k denotes the subset of T k consisting of elements which were refined to get 71 ■ Taking into account 
that M. k is a minimal subset of T k satisfying the Dorflcr's criterion, using Lemma 6.1 and recalling the 
choice of e we conclude that 

#M k < #K k < #% - #% < #T e - #T < \u\le~i 

1 

= (^f^) 23 nl(\\V(U k -u)\\l +osc^ ([/,))'*. □ 

The next result bounds the complexity of a mesh T k in terms of the number of elements that were 
marked from the beginning of the iterative process, assuming that all the meshes were obtained by the 
bisection algorithm of [19], and that the initial mesh was properly labeled (satisfying condition (b) of 
Section 4 in [19]). 

Lemma 6.3 (Complexity of REFINE). Let us assume thatl^ satisfies the labeling condition (b) of Section 
4 in Ref. [19], and consider the sequence {7fe}fceN of refinements of To where T k +i ■= REFINE(T k , A4 k ,n) 
with A4 k C Tk- Then, there exists a constant Cs > solely depending on 7o and the number of 
refinements n performed by REFINE to marked elements, such that 

fc-i 



#T fe - #T < C s J2 *Mi, for all k £ N. 



i=0 

The next result will use Lemma 6.3 and is a consequence of the global lower bound (Theorem 3.2), 
the bound for the cardinality of Ai k given by Lemma 6.2 and the contraction property of Theorem 4.2. 
This is the second main result of the paper. 

Theorem 6.4 (Quasi-optimal convergence rate). Let us assume that 7o satisfies the labeling condition 
(b) of Section 4 in Ref. [19]. Let us assume that the weak solution u of problem (4) belongs to A s . If 
{[/fc}fegN denotes the sequence computed through the Adaptive Algorithm, and the marking parameter 9 
satisfies < 9 < 9 (cf. {38)), then 

[\MUk-u)\\ 2 n +080^)]* <CM s (#T fc -#T )- s , Vfc£N, (43) 

where C > depends on d, Kt, problem data, the number of refinements n performed over each marked 
element, the marking parameter 9, and the regularity index s. 
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Proof. Let k G N be fixed. The global lower bound (Theorem 3.2) yields 

\MU t - u)\\l + WtM) < C 1 + vC'l 1 ) [\MU t - u)\\l + osc^(^)] . < i < k - 1, 
where fi is the constant appearing in Theorem 4.2. Using Lemmas 6.3 and 6.2 it follows that 



k— i ✓ 1 k— i 

(2C E C A \^ 



#T* - #To <C S Y, #Mi < C s \u\] J2 [H y (^ - u )Wn + osc^f/i)] 





fc-1 



< c s f^£A 2S \ u \l (i + ^l 1 ) * Yl [\M U i - «)lln + prfcmr*- (44) 

Since we do not have a contraction for the quantity [||V(J7, — u )\\n + W^t^i)] as happens in the linear 
problem case, we now proceed as follows. We define zf := [F{Ui) — J-(u)] + /j,rjj-.(Ui), the contraction 

property (Theorem 4.2) yields Zi+i < pz{ and thus, z i 3 < p^ z i+ \. Since p < 1, taking into account (36), 
we obtain that 3 

k — 1 k— 1 oo 

E(IIV(^-")II^ + M^(^))"* < (Ci/2)&5>r* < (C A /2)*X>*)V* 

l 

1 -p- 

1 

<(C A c^)A-^(||V([/ fc - U )||2 +^(C/ fc ))-*. 

1 

Using the last estimation in (44), it follows that 

#r fc - #r < c s (-9e9±\ 23 H |(i + A (CAC -i)i^l_ (|| V (t7 fc - U )||2 + m 2 rk {u k )Y^ , 

\ / 1 — p'> 

and using that oscj- k (Uk) < ifr k {Uk) and raising to the s-power we have that 

i 

(#% #%y < ^ ( — ) 2 (i + »c- L x y p \ u \ s (\\v(u k - U )\\ 2 Q +^ Th {u k ))-K 

Finally, from this last estimation the assertion (43) follows, and the proof is concluded. □ 

We conclude this article with a few remarks. 
Remark 6.5. The problem given by (1) is a particular case of the more general problem 

J -V- [a(-,\Vu\ 2 A )AVu] = / infi 

u = on<9f2, 

where a : il x M + — » K + and / g L 2 (£l) satisfy the properties assumed in the previous sections, and 
A : f2 — > M. dxd is such that A(x) is a symmetric matrix, for all x S f2, and uniformly elliptic, i.e., there 
exist constants a, a > such that 

a|£| 2 <-4(:r)£-£ <a|£| 2 , V x G 0, £ e R d . 

If .4 is piccewisc constant over an initial conforming mesh 7o of f2, then the convergence and optimality 
results previously presented also hold for this problem. 

Remark 6.6. We have assumed the use of linear finite elements for the discretization (see (11)). It is 
important to notice that the only place where we used this is for proving (25). The rest of the steps of 
the proof hold regardless of the degree of the finite element space. The use of linear finite elements is 
customary in nonlinear problems, because they greatly simplify the analysis. 



In this estimation we assume for simplicity that ca and Ca are chosen so that < 2 < Ca- 
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